Serveur d'exploration sur la TEI

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Towards a TEI-based encoding scheme for the annotation of parallel texts

Identifieur interne : 000076 ( Main/Exploration ); précédent : 000075; suivant : 000077

Towards a TEI-based encoding scheme for the annotation of parallel texts

Auteurs : Peter Boot [Pays-Bas]

Source :

RBID : ISTEX:C87E170B5B672608C2723A492673F975F0780E43

Descripteurs français

English descriptors

Abstract

Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.

Url:
DOI: 10.1093/llc/fqp023


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI wicri:istexFullTextTei="biblStruct">
<teiHeader>
<fileDesc>
<titleStmt>
<title>Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<author wicri:is="90%">
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">ISTEX</idno>
<idno type="RBID">ISTEX:C87E170B5B672608C2723A492673F975F0780E43</idno>
<date when="2009" year="2009">2009</date>
<idno type="doi">10.1093/llc/fqp023</idno>
<idno type="url">https://api.istex.fr/document/C87E170B5B672608C2723A492673F975F0780E43/fulltext/pdf</idno>
<idno type="wicri:Area/Istex/Corpus">000320</idno>
<idno type="wicri:Area/Istex/Curation">000320</idno>
<idno type="wicri:Area/Istex/Checkpoint">000045</idno>
<idno type="wicri:explorRef" wicri:stream="Istex" wicri:step="Checkpoint">000045</idno>
<idno type="wicri:doubleKey">0268-1145:2009:Boot P:towards:a:tei</idno>
<idno type="wicri:Area/Main/Merge">000076</idno>
<idno type="wicri:source">INIST</idno>
<idno type="RBID">Francis:11-0223733</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000009</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000036</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000005</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000005</idno>
<idno type="wicri:doubleKey">0268-1145:2009:Boot P:towards:a:tei</idno>
<idno type="wicri:Area/Main/Merge">000108</idno>
<idno type="wicri:Area/Main/Curation">000076</idno>
<idno type="wicri:Area/Main/Exploration">000076</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title level="a">Towards a TEI-based encoding scheme for the annotation of parallel texts</title>
<author wicri:is="90%">
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
<affiliation wicri:level="1">
<country xml:lang="fr">Pays-Bas</country>
<wicri:regionArea>Huygens Institute, Royal Netherlands Academy of Arts and Sciences, The Hague</wicri:regionArea>
<wicri:noRegion>The Hague</wicri:noRegion>
</affiliation>
<affiliation wicri:level="1">
<country wicri:rule="url">Pays-Bas</country>
</affiliation>
</author>
</analytic>
<monogr></monogr>
<series>
<title level="j">Literary and Linguistic Computing</title>
<idno type="ISSN">0268-1145</idno>
<idno type="eISSN">1477-4615</idno>
<imprint>
<publisher>Oxford University Press</publisher>
<date type="published" when="2009-09">2009-09</date>
<biblScope unit="volume">24</biblScope>
<biblScope unit="issue">3</biblScope>
<biblScope unit="page" from="347">347</biblScope>
<biblScope unit="page" to="361">361</biblScope>
</imprint>
<idno type="ISSN">0268-1145</idno>
</series>
<idno type="istex">C87E170B5B672608C2723A492673F975F0780E43</idno>
<idno type="DOI">10.1093/llc/fqp023</idno>
<idno type="ArticleID">fqp023</idno>
</biblStruct>
</sourceDesc>
<seriesStmt>
<idno type="ISSN">0268-1145</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Computational linguistics</term>
<term>Corpus annotation</term>
<term>Feature structure</term>
<term>Markup language</term>
<term>Parallel corpus</term>
<term>TEI</term>
<term>Text structure</term>
<term>Translation</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Annotation de corpus</term>
<term>Corpus parallèle</term>
<term>Langage de balisage</term>
<term>Linguistique informatique</term>
<term>Structure de traits</term>
<term>Structure textuelle</term>
<term>TEI</term>
<term>Traduction</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Traduction</term>
</keywords>
</textClass>
<langUsage>
<language ident="en">en</language>
</langUsage>
</profileDesc>
</teiHeader>
<front>
<div type="abstract">Translation, adaptation, and other forms of appropriation of literary works can result in bodies of parallel texts. For the purpose of studying appropriation strategies, it is important to be able to annotate digital representations of these parallel text structures. This article uses early modern emblem culture (books of engravings or woodcuts, accompanied by mottos and explanatory texts) to investigate the forms this text parallelism may take. It defines requirements for annotation definition and proposes a TEI (Text Encoding Initiative) extension to implement these requirements. In the proposed encoding scheme, TEI feature structures will be used for storing annotation information. This scheme should be useful for annotating parallel text structures as well as for other annotation tasks. The annotation scheme assumes the annotated texts are available in XML. If this is not the case (there is no electronic version of the text at all or perhaps only a facsimile) the article suggests the definition of a TEI proxy document. A TEI proxy document contains enough of the structural aspects of the texts to serve as a basis for attaching annotations to the text. Outside of the annotation context, proxy documents may serve as a basis for adding functionality to image-based editions.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Pays-Bas</li>
</country>
</list>
<tree>
<country name="Pays-Bas">
<noRegion>
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
</noRegion>
<name sortKey="Boot, Peter" sort="Boot, Peter" uniqKey="Boot P" first="Peter" last="Boot">Peter Boot</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Ticri/explor/TeiVM2/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000076 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000076 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Ticri
   |area=    TeiVM2
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     ISTEX:C87E170B5B672608C2723A492673F975F0780E43
   |texte=   Towards a TEI-based encoding scheme for the annotation of parallel texts
}}

Wicri

This area was generated with Dilib version V0.6.31.
Data generation: Mon Oct 30 21:59:18 2017. Site generation: Sun Feb 11 23:16:06 2024